Feasibility of Human-in-the-loop Minimum Error Rate Training
نویسندگان
چکیده
Minimum error rate training (MERT) involves choosing parameter values for a machine translation (MT) system that maximize performance on a tuning set as measured by an automatic evaluation metric, such as BLEU. The method is best when the system will eventually be evaluated using the same metric, but in reality, most MT evaluations have a human-based component. Although performing MERT with a human-based metric seems like a daunting task, we describe a new metric, RYPT, which takes human judgments into account, but only requires human input to build a database that can be reused over and over again, hence eliminating the need for human input at tuning time. In this investigative study, we analyze the diversity (or lack thereof) of the candidates produced during MERT, we describe how this redundancy can be used to our advantage, and show that RYPT is a better predictor of translation quality than BLEU.
منابع مشابه
Error assessment in man-machine systems using the CREAM method and human-in-the-loop fault tree analysis
Background and Objectives: Despite contribution to catastrophic accidents, human errors have been generally ignored in the design of human-machine (HM) systems and the determination of the level of automation (LOA). This paper aims to develop a method to estimate the level of automation in the early stage of the design phase considering both human and machine performance. Methods: A quantita...
متن کاملPotential Effects of Climatic Parameters on Human Brucellosis in Fars Province, Iran, during 2009-2015
Background: Human brucellosis is widespread in Fars province. The present study aimed to investigate the effect of climate on its incidence and determine the areas prone to the infection.Methods: Monthly meteorological data and the incidence rate of human brucellosis during 2009-2015 were collected and their correlation was studied using Pearson’s correlation coefficient. Additionally, th...
متن کاملCovariance Analysis of a vector tracking GPS receiver based on MMSE multiuser Detection
In high dynamic conditions, using vector tracking loops instead of scalar tracking loops in GPS receivers is proved as an efficient method to compensate the performance. The Minimum Mean Squared Error detector as a multiuser detector is applied in the vector tracking loop for more reliability and efficiency. The Kalman filter does the two tasks of tracking and extracting the navigation data aft...
متن کاملEvaluation of Human Reliability by Standardized Plant Analysis Risk HRA (SPAR-H) method in the Dialysis Process in Ebne Sina Hospital, Shiraz
Background and Objectives: Human errors in dialysis care can cause injury and death. One of the basic steps to increase reliability in this critical process is to analyze the error and identify the weaknesses of doing this process. Methods: The present study is a descriptive-analytic cross-sectional study. The SPAR-H method was used to identify and evaluate the probability of human error in th...
متن کاملInvestigation of human error by using THERP method in control room of incoiler department in a pipe manufacturing company
Background & Aims of the Study: Today, in many sensitive occupational environments, human error can lead to catastrophic events. Given that the sensitive task of a control area operator, which in the occurrence of malfunction or failure leads to irreparable events, it is important to predict human errors to reduce its adverse consequences. Therefore, the present study was perform by aiming to ...
متن کامل